A novel method for constrained class association rule mining
نویسندگان
چکیده
To create a classifier using an associative classification algorithm, a complete set of class association rules (CARs) is obtained from the training dataset. Most generated rules, however , are either redundant or insignificant. They not only confuse end users during decision-making but also decrease the performance of the classification process. Thus, it is necessary to eliminate redundant or unimportant rules as much as possible before they are used. A related problem is the discovery of interesting or useful rules. In existing classification systems, the set of such rules may not be discovered easily. However, in real world applications, end users often consider the rules with consequences that contain one of particular classes. For example, in cancer screening applications, researchers are very interested in rules that classify genes into the ''cancer'' class. This paper proposes a novel approach for mining relevant CARs that considers constraints on the rule consequent. A tree structure for storing frequent itemsets from the dataset is designed. Then, some theorems for pruning tree nodes that cannot generate rules satisfying the class constraints are provided and proved. Finally, an efficient algorithm for mining constrained CARs is presented. Experiments show that the proposed method is faster than existing methods. Association rule mining and classification are two important and common problems in the data mining field. Therefore, numerous approaches have been proposed for the integration of these models. Examples include classification based on association rules (CBA) [23], a classification model based on multiple association rules [21], a classification model based on predictive association rules [41], multi-class and multi-label associative classification [34], a classifier based on maximum entropy [35], the use of an equivalence class rule tree [39], a lattice-based approach for classification [29], the integration of taxonomy information into classifier construction [6], the integration of classification rules into a neural network [20], a condition-based classifier with a small number of rules [11], an efficient classification approach with a rule quality metric [10], and a combination of a Netconf measure and a rule ordering strategy based on rule size [15]. The implementation processes of these approaches are often similar. Firstly, a complete set of class association rules (CARs) is mined from the training dataset. Then, a subset of CARs is selected to form the classifier. However, the complete
منابع مشابه
A Novel Method for Selecting the Supplier Based on Association Rule Mining
One of important problems in supply chains management is supplier selection. In a company, there are massive data from various departments so that extracting knowledge from the company’s data is too complicated. Many researchers have solved this problem by some methods like fuzzy set theory, goal programming, multi objective programming, the liner programming, mixed integer programming, analyti...
متن کاملFUZZY GRAVITATIONAL SEARCH ALGORITHM AN APPROACH FOR DATA MINING
The concept of intelligently controlling the search process of gravitational search algorithm (GSA) is introduced to develop a novel data mining technique. The proposed method is called fuzzy GSA miner (FGSA-miner). At first a fuzzy controller is designed for adaptively controlling the gravitational coefficient and the number of effective objects, as two important parameters which play major ro...
متن کاملAssociation rule mining application to diagnose smart power distribution system outage root cause
Smart grid has been introduced to address power distribution system challenges. In conventional power distribution systems, when a power outage happens, the maintenance team tries to find the outage cause and mitigate it. After this, some information is documented in a dataset called the outage dataset. If the team can estimate the outage cause before searching for it, the restoration time will...
متن کاملA Model of Immune Gene Expression Programming for Rule Mining
Rule mining is an important issue in data mining. To address it, a novel Immune Gene Expression Programming (IGEP) model was proposed. Concepts of rule, gene, immune cell, and antibody were formalized. The dynamic evolution models and the corresponding recursive equations of immune cell, self, immune-tolerance were built. The novel key techniques of IGEP were presented. Experiment results showe...
متن کاملMining Class-Association Rules with Constraints
Numerous fast algorithms for mining class-association rules (CARs) have been developed recently. However, in the real world, end-users are often interested in a subset of class-association rules. Particularly, they may consider only rules that contain a specific item or a specific set of items. The nave strategy is to apply such item constraints into the post-processing step. However, such appr...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Inf. Sci.
دوره 320 شماره
صفحات -
تاریخ انتشار 2015